Estimating recursive query costs for various parallel environments
نویسندگان
چکیده
We introduce a recursive query evaluation method whose main goal is to obtain recursive query cost estimate without performing the query. In particular, we show that the execution estimate of the parallel processing of recursive queries can be done by exploiting the characteristics of the database profile during the evaluation process for request optimization. As many recursive queries involve computation of a transitive closure, the method estimates both transitive closure sizes and transitive closure execution costs. 1 Introduction To support such applications as integrated office systems, design and engeneering, or expert systems, database systems of the new generation must support complex objects and /or deduction as well as offer extensibility. In addition , these systems must deal with a wide range of computer architectures, and provide for parallelism and adaptability. Coupling the support of recursive queries on complex objects, the use of parallelism, extensiblity, and adaptability requires a new approach to database system design. A possible way to achieve this goal is to embed a large part of the coupling in a query-cost evaluation tool. This paper describes such a tool and focuses on cost evaluation techniques for complex recursive queries. ACE, an adaptative query-cost evaluator tool designed at the Bull Corporation Research Center [AND90a], provides extensiblity and adaptability through its library-oriented design. Four librairies provide parameters and formulas describing architectures, relational operations tool, it is possible to study the effect of chosen environment parameters on the cost of given queries, while keeping the values of the other environment parameters b e d. Thus, questions such as: " what is the best algorithm for a given parallel architecture ?", or "what is the best architecture for a given application (database + queries) ?", can be answered. ACE provides a means of experimentally evaluating new designs without having to actually build a DBMS to do so. For ACE to accept query trees involving a transitive closure operation it was necessary to define a methodology for the estimation of re-cursive query costs in the context of various parallel environments. Several efficient parallel transitive closure algorithms have been proposed to take advantage of parallel architectures. They can be classified in two classes: those algorithms that assume a shared memory architecture and those that assume a distributed memory architecture. In the first class, we mainly find parallel transitive closure algorithms as Warren and adaptations of Warren algorithms (for example [LU87]). The two main parallel …
منابع مشابه
Optimization Strategies for Parallel Linear Recursive Query Processing
Query optimization for sequential execution of non-recursive queries has reached a high level of sophistication in commercial DBMS. The successful application of parallel processing for the evaluation of recursive queries will require a query optimizer of comparable sophistication. The groundwork for creating this new breed of query optimizer will consist of a combination of theoretical insight...
متن کاملObject-Oriented Programming with Recursive Queries
Recursive queries are required in many object-oriented database applications. Among them we can mention Bill-Of-Material (BOM), various kinds of networks (transportation, telecommunication, etc.), processing semistructured data (XML, RDF), and so on. The support for recursive queries in current query languages is limited and lacks theoretical foundations. In this paper we present recursive quer...
متن کاملA Parallel Processing Strategy for Evaluating Recursive Queries
The set of resolvents generated by a recursive intension in a lirst-order database is treated as a set of concurrent database queries. A strategy for egiciently ev,aluating these concurrent queries in a multi-processor environment is presented. The strategy combines three query processing techniques, namely, query decomposition, intermediate result sharing and data-flow and pipelined query exec...
متن کاملMap Combine Map Task Split HDFS file K 1 , N 1 ( a ) Reduce Task { P 2 } { P 1 } { P 3 }
In parallel query-processing environments, accurate, time-oriented progress indicators could provide much utility to users given that queries take a very long time to complete and both interand intra-query execution times can have high variance. In these systems, query times depend on the query plans and the amount of data being processed, but also on the amount of parallelism available, the ty...
متن کاملThe Application of Recursive Mixed Models for Estimating Genetic and Phenotypic Relationships between Calving Difficulty and Lactation Curve Traits in Iranian Holsteins: A Comparison with Standard Mixed Models
In the present study, records on 22872 first-parity Holsteins collected from 131 herds by the Animal Breeding and Improvement Center of Iran from 1995 to 2014 were considered to estimate genetic and phenotypic relationships between calving difficulty (CD) and the lactation curve traits, including initial milk yield (Ap), ascending (Bp) and descending (Cp) slope of the lactation curves, peak mil...
متن کامل